
2/36 



.FIG. 





SUBCtASS 



FIGURE 3A 



tbpB 



10 
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40 



50 
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CGCTTGCAGA TTTGTAAAAA 


ATTTAGCTAA 


AATCAGACCT 


GGCTTGIATT 


51 


TTAGGGTTAT 


TATGGAACAG 


ACAACGGAAC 


AAATAGATTA 


TAAATTATTA 


101 


AAGCATCGTT TTCGTGGCTA 


TTTACCGCTT 


GTGATTGATG 


TGGAACAGA6 


151 


GCTTAAATGC 


CCAAACTGTG 


CCTTATTGGA 


ATTGGCCGGA 


ATTACTTTAA 


201 


AATTGGACX3A 


GCCAAGGTTA 


TCTTATTGCC 


GGATTCAACA 


ATGCCAATTT 


251 


CCAATTTGCX3 


GCCATTGGAG 


GGGGCGATTT 


TCAAGGGCCG 


AATATTTACC 


301 


ctcagtcatt 


AAAATTCAAC 


GGCATTCATA 


TTCACAACCC 


TCTAGAGGAA 


351 


GCGGTTTAGA 


GAATATTGCC 


CAATTCCCGG 


AAATGTTCAA 


AATGGTTCGC 


401 


AAAGCAATGA 


AGGAAGCAAG 


GCTGCCAACG 


GGCGGTGATT 


GTTGCCCCAC 


451 


AATGCCAGCG 


TTCGATCAGG 


GCrri'l'iTAC 


AGGGCTGCGA 


TTAAACGGAA 


501 


TGCCAAGCGA 


GAGATCCCTT 


TCACCCTTTT 


GGCX3ATGTTT 


GACTACCGCC 


551 


GCCACCCTTT 


GCAGGrriTA 


TGTTATGGGC 


AAACCGTGTT 


CGTTAAAAGC 


601 


CTGCCAGTGC 


AAAATCCGTT 


CACGTAGCAG 


CACTATAGGG 


CGAATTGGGT 


651 


ACCGGGCCCC 


CCTCGAGGTC 


GACX3GTATCG 


ATAAAGCTTC 


ATATCGAATT 


/ ux 


CCTGCAGCCC 


GGGGGATCCG 


ATGCGCCTT6 


CGGCTCAAGT 


TATTAGTGGC 


751 


ATCGGTTTTT 


TGGGTGATGG 


TGTCATTTTG 


CATAAGAAAA 


ATGATGCGAT 


801 


TTCAGGTTTA 


CCCACTGCGG 


CGATTATTTG 


GGCTTCTGCG 


GGGATCGGTA 


851 


TTGCTGCAGG 


GGAGGGTITC 


GTGTTTCATG 


CGGTCATCGC 


CACTGTCATT 


901 


ATITiGGTGT 


CTATTCGATT 


ATGTCCGTTG 


GTTCAACGTT 


GGGTTCATCG 


951 


TAAATCACAA 


CGTCGTCGAC 


GAAATATTCT 


TGTCAAT6AT 


GCX3GAAAGCA 


1001 


TACGGAAAGT 


TACCCAATTG 


TTATTCAATA 


ATCAGTATCG 


TATTGAACAT 


1051 


ATACAAGTCA 


AAGATCAAAG 


TAGTGGAGAA 


GTTGCCGGTT 


ACAAATTCGT 
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1 1 m 

X X U X 




CAATGTTCftA 


AGATGCGTAT 


GCTTTACTTA 


aagcagaaga 


X X ^ X 


TC5GCGAATAC 


TGTTCAAGTA 


GATATCATGA 


AAAAGAGTGC 


TTATATTCAA 


X ^ U JL 




TTTATTTAAT 


TTCTTTCCAC 


AAAAGATCAT 


TTTCAATTAT 


1 O^l 
JL^D JL 


ATATACTGGA 


ATTTTGCCTT 


ACX3CTATCTG 


TCATTTATTA 


TGCTATTCAA 


XO U J. 




CTATGGAACA 


ACAATCAAAA 


TGTACGCTTA 


TCGGCTGCCG 


X ^ D X 


A Tn A TTTCIG A 


TAATGATCGA 


TGTGCAGAAA 


TATTTGAACT 


TACGATTTTC 


X *X V X 


ACTGAGCAGT 


CAAAGCACX3T 


TCGCXSAGTAT 


CGACTCTACT 


TTATTTATCG 


1451 

X^ ^ X 


TTTGTGCACT 


ATGTATAATC 


CACCTAATTC 


CXSTGCCTTGG 


CCATAAAAGC 


1501 


CCCCTTCAAA 


TTGTATTTAT 


ATCAGCTACC 


GTGCCACCAT 


TCGTACTTTT 


1551 


CGGATCAAGA 


TTAAAACAGA 


ATCCCTGCAT 


GCACAGCGAA 


ATCAGCTGCG 


1601 


GTATGCGAAA 


CGCCGCAGGA 


GCGCAGTACG 


CGAAGTGTAC 


CX5TCACTATC 


1651 


AGTGCTAGAT 


TTGTCAATAA 


AAAATTAGTG 


ACCAAGCTTG 


GGTGCATAAT 


1701 


GATGGTGATG 


AAAGAACGCT 


CAATGCTTGA 


CACGTTGCAG 


GCTATCTGTA 


1751 


AGGGTATGGT AGTTACAGGC 


ACA6CCCAAA 


CGGCCAATTG 


CTGGTTTTTT 


1801 


ATCCTTGATC 


CGAACAGTAC 


GAATGGTGGT 


cacx;gtagtg 


CATATACATA 


1851 


CTCAATTGAA 


GGGGGCTTTT 


ATGGCCCTAA 


GGCGACGGAA 


TTAGGTGGTA 


1901 


TTGTACATAG 


TGCAGAAACX3 


6ATAAAGATA 


GAGTCAGTAT 


TACATTCGGC 


1951 


GGAAAACGTC 


AAATAGAAAA 


ATAATCATAA 


TTCCCClTiG 


CTGGTTGTAG 


2001 


ATAGCAGCGG 


GCAATTTTTT 


ATAAAAATTT 


GCAAAATTTA 


AATAA 
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10 



1 AGACCCTATC 
51 TGCCTTAACA 
101 AAAATAAAAA 
151 ATTGTGACAG 
201 CTTGGGGAAA 
251 TTGGTATTCG 
301 C7AGGTCGCG 
351 CCGTGTCAGC 
401 CGCTGGTTCA 
451 ACATTCGTTC 
501 TCTGGTGCGC 
551 TATTATTAAA 
601 CCAGCAAAAA 
. 651 GTGGTTTTCA 
701 AAAATTCATT 
751 CTTTCAAAAT 
801 AGGATCTTTT 
851 CTCGTGCAAG 
901 CGAATATACG 
951 GAGCTCTCAG 
1001 CTTTAGATTA 
1051 ACCTCGTCTC 
1101 ACGATATCCG 
1151 TACTTATCAC 
1201 TTAGTGTTCA 




FIGURE 3C 
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1 

TAATGATAAT 


GAAATATCAT 


CAl'l-iTCGCT 


1 

ATTCACCTGT 


GTGTTATTTG 


CTCTTTCTCA 


TTCATACGGT 


GCTGCGACTG 


AATCGAAGAA 


AATAACGATC 


TAGCTGTTCT 


GGATGAAGTT 


AGAGCCATTA 


TGCTCACGAA 


CGTCAAAACG 


AAGTAACTGG 


GTAGTGAAAA 


ATTATCACGA 


AATGAGTAAA 


AATCAAATTC 


TGATTTAACT 


CGCTATGACC 


CTGGTATTTC 


GGTGGTGGAA 


GTGCAAGTAG 


TGGCTATGCC 


ATTCGAGGTG 


TAGATAAAAA 


TTACTTGTTG 


ATGGGCTACC 


ACCAGCGCAC 


AGTTATCATA 


GATGCTAATG 


GTGGTGCAAT 


TAATGAGATT 


GAGTATGAAA 


AATTGAGTTA 


AGCAAAGGAG 


CAAGTTCTGC 


GGAATATGGC 


ATGGTGGTGC 


TATTGGTTTT 


CGTACTAAAG 


ATCGCCAGGA 


GAGGGGCAGC 


ATTGGGGCTT 


ACATAGTAAG 


ACCTCTTATG 


TAGCCATTTT 


TACAGTCTAT 


CGCAGCGGCT 


GGTCAGGCGG 


AGCACTTGTT 


ATTGCAACTC 


ACCGACACGG 


TAAAGAGACC 


CCGAGGCAAA 


TCAATTACAT 


ATTATTCGGC 


6TATAACCGG 


CGCTACGACT 


TTACCCAATT 


CCGCACAGAA 


TGCCTCCTGG 


TTTATTGTGG 


AAGATACTTG 


CCCAACATTA 


GATTGTACTC 


GGTTAAGTTG 


AACGCGATAA 


TTTCCCAGTC 


AGAACATTTC 


CCTGGAAGAG 


GCGAAACAGC 


TTGAGATTCC 


TTATCGCACT 


CCCAAAGAAT 


ARACCGGTAA 


AGATCGCATT 


GCACCAAACC 


CAAGAGTAAT 


TCTGTTTTTA 


TGAAGTTTGG 


CTATCACTTT 


ATTATCTTGG 


CGCATCTCAC 


AAGATGATAC 


AAAACAACGC 


TCATSTGCAA 


ACGCCAGCTT 


ACTATACAAA 


AGACGATATT 


TTTGGAACTA 


TGTTTATCAA 


GGGGATATTA 


TTTAGATGGC 


AGCCAAGGAT 


CCCTTATGGG 


TTGCGCATAT 


GCCATGTGAA 




1251 


CGTCACCACA 


AACGTCGTTT 


AGGATTCACC 


TATAAATATA 


AACCAGAGAA 


1301 


TAATCGCTGG 


TTGGATAGCA 


TTAACTCGTG 


CGTACGTGCT 


TTGCGCTCTC 


1351 


GCTGCTGTGC 


TCTGAGTAAA 


CAAGATATTG 


AACTATATAG 


CCX3GCTACAT 


1401 


CGCTTGCATT 


GTAGCX3ATTA 


TCCTGTGGTA 


GATAAAAATT 


GCGGCCCGAC 


1451 


TTTGGATAAA 


TCTGGTCTAT 


GTATCGAACT 


GAGCGTAATA 


ATTACCAAGA 


1501 


AAAGCATCGT 


GTCATTCATT 


TAGAATTTGA 


TAAAGCGCTA 


AATGCTGGTC 


1551 


AAGGCGTATT 


TAAGCAAACC 


CACAAACTGA 


ATTTAGGCTT 


GGGCiTlXSAA 


1601 


TCX3ATTAATC 


GCTTATGATC 


ATGGGGATAT 


GACTGCCCAA 


TATACCAAAG 


1651 


GCCGGTTATA 


CCAGCTACCG 


CGGAGAGGGG 


CTTTAGATAA 


TCCATATATT 


1701 


TATCGCCGCG 


ATCCACGCAG 


TATTGAAACX5 


GTATCTTTGT 


GTAATAATAC 


1751 


AGCGGCX5ACA 


CTTAACTGT6 


ACGCGTTAAA 


TAAAGGCATA 


CGTTTGTACC 


1801 


TCCGCTGCAC 


TTAGGAACTA 


TAGTTTATGA 


AGGGGATAAT 


ATTTAGATGG 


1851 


CTTAGTGTTC 


AAGCAAGCAA 


GGATCCCTTA 


TGGGTT6CGC 


GATATGCCAT 


1901 


GTGAAGTTTT 


TGATGAACGT 


CACCACAAAC 


GTCGTTTAGG 


ATTCACACCT 


1951 


ATAAATATAT 


AAACCAGAGA 


ATAATCGCTG 


GTTGGATAGC 


ATTAACTCGT 


2001 


GCGTACGTGC 


TTTGCGCTCT 


CGCTGCT6TG 


CTGTGAGTTA 


AACAAGATAT 


2051 


TGAAACTTAT 


AGCCGGCTAC 


ATCGCTTGCA 


TTGTGAGCGA 


TTATCCTGTG 


2101 


GTAGTAGTAA 


AAATTGCGGC 


CCGALTi-iGG 


ATAAATCTTG 


GTCTATGTAT 


2151 


CGAACTGAGC 


GTAATAATTA 


CCAAGAAAAG 


CATCGTGTCA 


TTCATTTAGA 


2201 


ATTT6ATAAA 


GCGCTAAATG 


CTGGTCAAGG 


CX3TATTTAAG 


CAAACCCACA 


2251 


AACTGAATTT 


AGGATTGGGC 


TTT6AATCGA 


TTAATTCX3CT 


TATGGATCAT 


2301 


GGGGATATGA 


CTGCCCAATA 


TACCAAAGGC 


CGGTTATACC 


AGCTAACCX3C 


2351 


GAGAGGGCGT 


TTAGATAATC 


CATATTTATC 


GCC6CGATCC 


ACGCAGTATT 


2401 


GAAACGGTAT 


CTTTGTGTAA 


TAATACACGC 


GCGACACTAA 


CTGTGACGCG 


2451 


TTAAATAAAG 


GCATACGTTT 


GTACCTCCGC 


TGCTGCCTAA 


TAAATCAAAA 


2501 


GAATAACCGA 


GATACGGTTC 


AGTGTTGTTC 


CAACCAGTTG 


CGATGGCCCA 
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SUBCLASS 



FIGURE 3E 



2551 CTACGTGAAC CATCACCCTA ATCAAG1TTT TTGGGGTCGA GGTGCCGTAA 
2601 AGCACTTAAC CCTTCTGTCG TCTCCCGTGG ATGCTTAAAT TCGCAGCTAG 

2651 TCGCAGGCAG GCACGTCACT CCTCTCGGTG ATTTCAGGTG CAACTGACCX3 
2701 GTTCTTGGTA CCACCCTTGA TATTAACCGG AGTCAATTAT AAAAACGAGT 
2751 TACGTGGAGC GCAATTTATA ATGTCX3ATGT CAGATACTGT AAAACTCTAT 
2801 ATTACCGTGG GCAGCAATTA GGTGACAGGG CCACGGGGCA AGCGAAACCA 
2851 GACGGGTACC AATTACACCG ATTTGCCGCC CCCGGGAGAG AAATTTCAGT 
2901 TACCATTCAA AGAAGTTTAG AGCCGGCCAA AAGAAAATAC AAAAAACGCT 
2951 GAAAGTATAT TCAGCGCGTT TTTGTTGCTC TAACGGATTA CATACGAATT 
3001 CAAAATGTTT TAACGGTCGG TTA 
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suaciAss 



FIGURE 5 



M1 2345678 910111213141516 



23.1 kb - 

9.4 kb - 

6.6 kb - 

4.4 kb - 

2.3 kb - 

2.1 kb - 
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FIGURE 6 




10/36 




FIGURE 7 



M1234567 M123456 M12345678 



23.1 kb - 

9.4 kb - 

6.6 kb - 

4.4 kb - 

2.3 kb - 

2.1 kb ~ 
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.F!G. 



BY OBass subclass 



FIGURE 9A 



fBP MIMKYHHFRYSTVALTVLFALSHSYGAATENKKIEENNDLAVLDEVIVTE 

NGTBPl MQQQ - HLFRLNI LCLSLMTALP - AYAENVQAGQAQEKQ LDTI QVKA 

NMl MQQQ - HLFRLNI LCLS LMTAL P - VYAENVQAEQAQEKQ LDTI QVKA 



50 
44 
44 



* * * 



PHTBP SHYAHERQNEVTGLGKWKNYHEMSKNQILGIRDLTRYDPGISWEQGRG 100 

NGTBPl KKQKTRRDNEVTGLGKLVKTADTLSKEQVLDIRDLTRYDPGIAWEQGRG 94 
NMl KKQKTRRDNEVTGLGKLVKSSDTLSKEQVLNIRDLTRYDPGIAWEQGRG 94 



* ******** ** 



** * * *********** ******* 



PHTBP ASSGYAIRGVDKNRVSLLVDGLPQAHSYHT LSGD ANGGAINEIE 144 

NGTBPl ASSGYSIRGMDKNRVSLTVDGLAQIQSYTAQAALGGTRTAGSSGAINEIE 144 
NMl ASSGYSIRGMDKNRVSLTVDGVSQIQSYTAQAALGGTRTAGSSGAINEIE 144 



***** *** ******* *** * ** 



******* 



PHTBP YENIRSIELSKGASSAEYGSGAHGGAIGFRTKDAQDIIKEGQHWGLDSKT 194 

NGTBPl YENVKAVEISKGSNSVEQGSGALAGSVAFQTKTADDVIGEGRQWGIQSKT 194 
NMl YENVKAVEISKGSNSSEYGNGALAGSVAFQTKTAADIIGEGKQWGIQSKT 194 



*** 



* *** ***** * ******** ** *** 



PHTBP SYASKNSHFLQ-IAAAGEAGGFEALVIATHRHGKETKIHSEANKLKHNIR 243 

NGTBPl AYSGKNRGLTQSIALAGRIGGAEALLIRTGRHAGEIRAHEAAGRGVQSFN 244 
NMl AYSGKDHALTQSLALAGRSGGAEALLIYTKRRGREIHAHKDAGKGVQSFN 244 



* *** ******** 



* * 



PHTBP RITGFENRYDFTQIPHRMPPGGS FFIVEDTCPTLDCTPRARVKLNRD 2 90 

NGTBPl RLAPVDD GSKYAYFIVEEECKNGGHE-KCKANPKKD 279 

NMl RLVLDEDKKE GGSQYRYFIVEEECHNG- YA-ACKNKLKED 282 



** 



* * * * * 



PHTBP NFPVRTFPEYTPEERNAEQIPYRTEQLSAQEKTGKDRIAPNPLDYKSNSV 340 

NGTBPl W GEDKRQT VSTRDYTGPNRFLADPLSYESRSW 312 

NMl AS VKDERKT VSTQDYTGSNRLLANPLEYGSQSW 315 



* * * 



***** 



PHTBP 

NGTBPl 

NMl 



PHTBP 

NGTBPl 

NMl 



FMKFGYHFNS S - HYLGAI LEDTKQR - TI S VI CKRQLTI QKT I LT YHLGTM 
LFRPGFRFENKRHYIGGILERTQQTFDTRDMTVPAFLTKAVFDANQKQAG 
LFRPGWHLDN-RHYVGAVLERTQQTFDTRDMTVPAYFTSEDYVP G 



** * ** * * 



FMKGIIFRWLSVQAK-DPLWVAHMPCEV DERHHKRRLGF 

SLRGNGNHKYAGNHKYGGLFTSGENNAPVGAEYGTGVFYDETHTKSRYGL 
SLKGLG--KYSGDNKAERLFVQGEGSTLQGIGYGTGVFYDERHTKNRYGV 



388 
362 
359 



426 
412 
407 



****** 
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PHTBP TYKY-KPENNRWLDSINSCVRALRSRCCALSKQDIELYSRLHRLHCSDYP 475 

NGTBPl EYVYTNADKDTWAD YARL SYDRQGIGLDNHFQQTHCSA-D 451 

NMl EYVYHNADKDTWAD YARL SYDRQGIDLDNRLQQTHCSH-D 446 

** ** _ * _ ^ * * * *** 

PHTBP WDKNCGPTLDKSWSMYRTERNNYQEKHRVIHLEFDLALNAGQGVFLQTH 525 

NGTBPl GSDKYCRPSADKPFSYYKSDRVIYGESHKLLQAAFKKSFDTAK IRH 497 

NMl GSDKNCRPDGNKPYSFYKSDRMIYEESRNLFQAVFKKAFDTAK IRH 492 

** * * ^ * _**_^_* ** * * 

PHTBP KLNLGLGFESINSLMDHGDMTAQYTLGRLYQL PRRDPRSIWTVSLCN 572 

NGTBPl NLSVNLGYDRFGSNLRHQDYYYQSAN-RAYSLKTPPQNNGKKTSPNGREK 546 

NMl NLSINLGYDRFKSQLSHSDYYLQNAV-QAYDLITPK KPPFPNGSKD 537 

* _ _ ** *_ * * _ 

PHTBP NT RATLNCDAL-NLGIRLYLRCCLINQLNNPRYGSVLFQFGTRV 615 

NGTBPl NPYWVSIGRGNWTRQICLFGNNTYTDCTPRSINGKSYYAAV--RDNVRL 594 

NMl NPYRVSIGKTTVNTSPICRFGNNTYTDCTPRNIGGNGYYAAV--QDNVRL 585 

*. *.* * . .. *..* ...*. 

PHTBP HRTWTPTSLGELPSIRAMAHYVNHHPNQVFWGRGAVKHLT LLSS 659 

NGTBPl GR-WADVGAG LRY- -DYRSTHSDDGSVSTGTHRTLSWNAGIVLKPA 637 

NMl GR-WADVGAG TRY- -DYRSTHSEDKSVSTGTHRNLSWNAGWLKPF 628 

** * * ** * * *. 

PHTBP PWM-LKFAASG--RHVTLSVISG-ATDRFLVPPLILTGVNYKNES YV 7 02 

NGTBPl DWLDLTYRTSTGFRLPSFAEMYGWRSGDKI KAVKIDPEKSFNKEAGI VFK 687 

NMl TWMDLTYRASTGFRLPSFAEMYGWRAGESLKTLDLKPEKSFNREAGIVFK 678 

* *. 

PHTBP SAI YNVDVRYCKTLY YRGQQLGDRATGQAKPDGY QLHRFA 742 

NGTBPl GDFGNLEASWFNNAYRDLI VRGYEAQI KDGKEQVKGNP - AYLNAQSARIT 73 6 

NMl GDFGNLEASYFNNAYRDLIAFGYETRTQNGQTSASGDP-GYRNAQNARIA 72 7 

* ★ * ..*.* * 

PHTBP APG RNFSYHSKKFRPAK ENTKNAESIFS 770 

NGTBPl GINILGKIDWNGVWDKLPEGWYSTFAYNRVRVRDI KKRADRTDIQSHLFD 786 

NMl GINI LGKIDWHGVWGGLPDGLYSTLAYNRI KVKDADI RADRTFVTS YLFD 777 

aa ••••••••• 

PHTBP A FFVGSNGLHTNSKSCFNGRLHEPIPYFFNFLRNVPRFNEYHCCC 815 

NGTBPl AIQPSRYWGSGYDQPEGKWGVNGMLT YSKAKEITELLGSR 827 

NMl AVQPSRYVLGLGYDHPDGIWGINTMFT YSKAKSVDELLGSQ 818 

* _** .... *... 



14/36 



FIGURE 9C 



PHTBP TSLIAASILLHHIYHWVFDFRYYYFV--YFCWILHHLIHINSFLMLLSHY 863 

NGTBPl -ALLNGNSRNTKATARRTRPWYIVDVSGYYTVKKHFTLRAGVYNLLNHRY 876 

NMl -ALLNGNANAKKAASRRTRPWYVTDVSGYYNIKKHLTLRAGVYNLLNYRY 867 

_ _ _* * *_ * * _* 



PHTBP --REWYLTCCACAFNIVTVNGF CVGCCSNI LAEMKF 898 

NGTBPl VTWENVRQTAAGAVNQHKNVGVYNRYAAPGRNYTFSLEMKF 917 

NMl VTWENVRQTAGGAVNQHKNVGVYNRYAAPGRNYTFSLEMKF 908 

* * * * * **** 
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FIGURE 10A 



MIMKYHHFRYSTVALTVLFALSHSYGAATENKKIEENNDLAVLDEVIVTE 

MHFKLNPY ALAFTSLFLVACSGGKGSFD LEDVRPNQ 

MHFKLNPY ALAFTSLFLVACSGGKGSFD LEDVRPNK 

*.*... ..*,***..**... *..* 

SHYAHERQNEVTGLGKWKNYHEMSK- -NQILG- - IRDLTRYDPGISWE 
TAKAEKATTSYQDEETKKKTKEELDKLMEPALGYETQILRRNKAPKTETG 

TTGVSKEE - - YKDVETAKKEKEQLGELMEPALGYWKV PVSSFE 

* ** 

QGRGASSGYAIRGVDKNRVSLLVDGLPQAHSYHTLSGDANGGAINEI-EY 
EKRNER- - - -WELSEDKITKLYQESVEIIPH- -LDELNGKTTSNDVYHS 
NKK VDISD lEVITNGNLDDVPYKANSSK-YNY 



ENIRSIELS KGAS S AE YGS GAHGGA I GFRTKDAQ -DIIKEG 

HDSKRLDKNRDLKYVRSGYVYDGSFNEIRRNDSGFHVFKQGIDGYVYYLG 

PDIK--TKDSSLQYVRSGYVIDGEHSGSNEK GYVYYKG 

* * 



QHWGLDSKTSYASKNSHFLQIAAAGEAGGFEALVIATHRH 

VTPSKELPKGKVISYKGTWDFVSNINLEREIDGFDTSGDGKNVSATSITE 

NSPAKELPVNQLLTYTGSWDFTSNANLNNE EGRPNYLNDDYYTK 

* * 

• • • •••• ■•• • 

- -GKETKI -HSEANKLKHNIRRITGFENRYDFTQIPHRMPPGGSFFIVED 
TVNRDHKVGEKLGDNEVKGVAHSSEFAVDFDNKKLTGSLYRNG--YINRN 

FIGK--RVGLVSGDAKPAKHKYTSQFEVDFATKKMTGKL SD 

* 

• •••••••••• • 

TCPTLDCTPRARVKLNRDNFPVRTFPEYTPEERNAEQIPYRTEQ--LSAQ 

KAQEVTKRYS I EADI AGNRFRG KAKAEKAGDPI FT - DSNYLEGG 

KEKTI YTVNADIRGNRFTGAATASDKNKGKGESYNFFSADSQSLEGG 

EKTGKDRIAPNPLDYKSNSVFMKFGYHFNSSHYLGAILEDTKQRTISVIC 
FYGPKAEE^4AGKFFTNNKSLFAVFAAKSENGETTTERIIDATKIDLTQFN 
F YGPKAEE^4AGKF VANDKS LFAVFSAKHNGSNVNTVR 1 1 DASKI DLTNFS 
* * * * * 

• • ••• •••• a • 

KRQLTI - -QKTILTYHLGTMFMKGI IFRWLSVQAKDPLWVAHMPCEVDER 
AKELNNFGDAS VL I I DGQKIDLAGVNFKNSKTVE INGKTMVAVACCSNLE 
I SELNNFGDAS VL I I DGKKI KLAGSGFTNKHTI EINGKTMVAVACCSNLE 
^ ...* * * ... ....* 
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FIGURE 10B 

HHKRRLGFTYKYKPENNRWLDS INSCVRALRSRCCALSKQDIELY 463 

YMK FGQLWQKEGKQQVKDNSLFLQGERTATDKMPAGGNYKYVGTWD 4 67 

YMK FGQLWQQAEGGKPENNSLFLQGERTATDKMPKGGNYKYIGTWD 421 



PHTBPl SRLHRLHCSDYPWDKNCGPTLDKSWSMYRTERNNYQEKATCHSFCI 510 

APL ALVSKGTNWIAEADNNRESGYRTEFDVNFSDKKVNGKLFDKGGVNPVFTV 517 

APL7 AQVSKENNWVATADDDRKAGYRTEFDVDFGNKNLSGKLFDKNGVNPVFTV 471 



PHTBPl LKALNAGQGVFKQTHKLNLGLGFESNLIRLTIIGIILPNIPKAGYTSYRG 560 
APL DATIN-GNGFIGSAKTSDSGFALDAGSSQHGNAVFSDIKVNGGFYGP--T 564 

APL7 DAKID-GNGFTGKAKTSDEGFALDSGSSRYENVKFNDVAVSGGFYGP--T 518 



PHTBPl RGRLDNPYIYRRDPRSIETVSLCMNTRATLLLLRYNKGIRLLLR 604 

APL AGELGGQFHHKSDNGSVGAV FGAKRQIE K 593 

APL7 AAELGGQFHHKSENGSVGAV FGAKQQVK K 547 
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FIGURE 11 

PHTBP 



NGTBP1 



NM1 
APL 



APL7 



20 30 40 50 60 70 80 90 100 
PERCENT SIMILARITY 
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FIGURE 12 



TonB box 



PHTBPl 

CIRP 

FEPAP 

FHUAP 

FBCAP 



DEVIVTE 
ETMWTA 
DTIWTA 
DTITVTA 
FTLSVDA 



22 
12 
17 
13 
30 



. 2 

PHTBPl 

CIRP 

FEPAP 

FHUAP 

FECAP 



YAIRGVD KNRVSLLVDG 

VS IRGLDSS YTLILVDG 

IDIRGMGPB NTLILIDG 

LI IRGFAAEGQSQMNYL-NG 
FGIRGLNPRLTSRSTVUIDG 
*♦* . ♦ ♦ 



121 
105 
102 
141 
199 



3 

PHTBPl 

CIRP 

FEPAP 

FHUAP 

FBCAP 



IBLSKGASSABYGSGAHGGAIGFRTKD 
IBWRGPMSSLYGSDALGGWNI ITKK 
lEVLRGPARARYGNGAAGGWNI ITKK 
ABIMRGPVSVLYGKSSPGGLLNMVSKR 
IDWRGGGAVRYGPQSVGGWNFVTRA 



177 
159 
180 
188 
251 



4 

PHTBPl 

CIRP 

FEPAP 

FECAP 

FBCAP 



FKQTHKLNLGLGF 
PBTSESWBLGLYY 
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FIGURE 14 
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FIGURE 16 
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FIGURE 17A 

i) h44 
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FIGURE 19A FIGURE 19B FIGURE 19C 
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FIGURE 20A 



ORF rnase T 



tbpB 



tbpa 



1 kb 



1168 

ATTTTTTATCTAATCTAAAAACAAGCGTTTCCACCTTGGAAATGATO 

CTGAAACTAATAAAGTTCAAACCIISi^TTAAGTTTATAT^^ 

-35 -10 ^ 

TTATTATTTTATAA ATTAAAGGAGA CATTATqTTTA A APTTAa a ^CTan^mvpT^TA 

SD Meti 



"1309 



CTG, 



tbpB. 



. . GAAAAA3MTCATAATTCCCCTTTGCTGGTTGTAG 
^^3063 

ATAGCAAGCGGGCAATTTTTTATAAAAATTTGCAAAATTTA AATAAAGGAGA ^ 

SD 

ATCTAATSATAATGAAATATCATCATTTTCGC tbpA AGAAA 

Met 



3151 

TTTCAGTTTAGCATTTGAAATGAAGTT TTAG 



5943 



FIGURE 20B 
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FIGURE 21 



ATGATAATGi^TATCATGATTTTCGCTATTCACCTGTTGCGT^ 

TTTCTCATTCATACGGTGCTGCGACTC 

TGTTCTGGATGAAGTTATTGTGACAGAGAGCCATTATG^ 

ACTGGCTTGGGGAAAGTAGTGAAAAATTATCACGAAATGAGTAAAAATCAAAl^ 
TTCGTGATTTAACTCGCTATGACCCTGGTATTTCGGTGGT^ 

TAGTGGCTATGCCATTCGAGGTGTAGATAAAAACCGTGTCAGCTTACT^ ^ 

ccacAagcgcacagttatcatacgctaggttcagatgctaatggtggtgcaat^ 
ttgagtatgaaaacattcgttcaattgagttaagcaaaggagca^ 
) ctctggtgcgcatggtggtgctattggttttcgtactaaagat6^ 
gaggggcagcattggggctta^atagtaagacctcttatgcc?v(k^ 

TACAGTCTATCGCAGCGGCTGGTGAGGCGGGTGGTTTTC 

CCGACACGGTAAAGAGACCAAAATTCATTCCGAGGCAAATAAATTAAAACATAATATTCG^ 

CGTATAACCGGCTTTGAAAATCGCTACGACTTTACCCAAATTCCGCACAG 

AGGATCTCCTTTTAATTGTGGAAGATACTTGCCCAACATTAGATTC 

GGTTAAGTTGAACCGCGATAATTTCCCAGTGAGAACATTTCCGGAATATACG 

CGCAAACAGCTTGAGCAGATTCCTTATCGCACTGAGCAGCTCTCAGCCCAAGAATATACCG 

GTAAAGATCGCATTGCACCAAACCCTTTAGATTACAAGAGTAATTCTGTT^ 

TGGCTATCACTTCAACTCGTCTCATTATCTTGGCGCAATCT^ 

TACGATATCCGTGATATGCAAACGCCAGCTTACTATACAAAAGACGATATTAACTTATCAC 

TTAGGAACTATGTTTATGAAGGGGATAATATTTTAGATGGCTTAGTGT^^ 

CCCTTATGGGTTGCGCTATAGCCATGTGAAGTTTTTTGATC 

TTAGGATTCACCTATAAATATAAACCAGAGAATAATCGCTGGTTGGATAC^^ 

GTGCGGATAAACAAGATATTGAACTATATAGCCGGCTACATCGCTTGCAT^ 

TCCTGTGGTAGATAAAAATTGCCGCCCGACTTTGGATAAATCT^^ 

GAGCGTAATAATTACCAAGAAAAGCATCGTGTCATTCATTTAGAATTTGATA 

ATGCTGGTCAAGGCGTATTTAACCAAACCCACAAACTGAATTTAGGGTl^^ 

ATTTAATTCGCTTATGGATCATGGGGATATGACTGCCC^ 

AGCTACCGCGGTAGAGGGCGTTTAGATAATCCATATATTTATCGCCGCGATC 

TTGAAACGGTATCTTTGTGTAATAATACACGCGGCGACATCTTAAACTC 

AATTAAAGGCGATAGCCATTTTGTTAGCTTCCGCGATCTAGTGATAAGCGAGTATG 

TTGGGATTAGGGGTGCGTTTTGATCAACATCGATTTAAAT^ 

GCCGAACTTATCGAAATTGGTCTTGGAATGGTGGGATTACGCT^ 

ATCGCTTTCTTATCGCATTTCAAACGGTTTTAGAGTGCCTG^ 

AAACGTGATCATATTGGGCTTAAAGATAACGAATATGTGCAACGCGCGCAACG^^ 

AGTTAGAGCCAGAAAAATCGACTAATCATGAGATTGGAGTTAGCTTTAAAGGT^ 

TTACCTTGATGTGAGCTATTTCCGTAATAACTATAAAAATATGATT^ 

AGAATAATACAAAAATCACACTGTTTCTATAACTACCATAATATTCAAGATGTAGCACT^ 

ACGGGATAAATTTAGTCGCTAAATTTGACTTACACGGTATTTTATCTATGCTGCCAGAT^ 

TTTTT ATTCATCAGTTGCTTATAACCGTGTAAAAGTAAAAGAGCGGAAACTAACCGACTC 

AGACTCGATAGCGTAAACGATCCTATTCTAGATGCGATTCAGCCAGCACGCTATGTGCTTG 

GATTCGGCTACGATCACCCAGAAGAAAAATGGGGAATTGGCATTACTACCAC 

AGCCAAAAACGCCGATGAGGTGGCAGGCACACGTCATCACGGNATACATCGCGTTGAT^ 

GGTGGCAAACTGACCGGTTCTTGGTACACCCATGATATTACCGGTTACATCAA^ 

ACTACACCTTACGTGGAGGAATTTATAATGTGACTAATCGTAAATATTC 

AGTGCGCCAATCCGGTGTGAATGCAGTAAACCAAGACCGGGGTAGCAATTACACTCGAfTTT^^ 

GGCGCTCCGGbGAGAAATTTtAGTTTAGCA^TTTGAAATC 
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ATGTTTAAACTTAAAAGTAGTTTTGTACTGCTTAATGCGGCGCTACTTGCTGCTTGTTCCT 
CAAATGGTGGAAGCTTTGATGTTCAATCTGCCAAAGTTGAATCTCAAACGCAAACTACCCC 
CAAAAAGCCAAGTTTACAAGATGATAATAGTAACGCAAGACGTACAGTAAGCGCTTCTGAA 
ACTGAAGCTTTATTGCAGCCGGGGTTTGGTTTTTCAGCCAAAATTCCGCGTCGTAATCTCC 
TTCCGCAGGGGAAGGAAGATGTAGCCCCTATTGGTGATATAAAAGAGATTACTGGAGATCT 
GCCAAAAATTCCGTATGAAGAAGAGGTTAAAGCGTGCGGTAGTAGTGCTGATGGATTTAGC 
CATACTCATGATAGAAATCATAAGTTGTATACAAGAGATTTTAATTTTGTTCGTTCCGGCT 
ATGTTGTGCATTCTGGTCCAAAACCTGAAATAAAGCCTAAAGAAATTTTGAGAACAGGTGC 
ACATGGGTATGTTTACTATTTAGGTATAGAGCCGCCCAAAGCAATACCTACCCAAAAACTA 
ACTTATAAAGGATATTGGGATTTTACTACCTATGCGGCTAAGGGGAGAGATAGTAATATTT 
TTCTAATTCCCGCAGGCATCAATAGTGGCGCCATACCGGAAAATAGTCACGATATTAATGT 
TGATGATTCTGAAAAACCAATGGGGCATACAGGAGAATTTACGGCTGATTTTGCTAATAAA 
ACTTTAACTGGAACATTGGTTCGTAATGGGTATGTTAGTCGTAGCAAAGAGCAAAAAATTA 
CAACAATTTACGATATTGATGCGAAAATTAAAGGTAATCGCTTTTCTGGTAAAGCAAACCC 
AAAAAAACCGATGATCCTTATTTTTGGGAAAAGCTCCACGACACTTGAAGGTGGATTTTTT 
GGTGGGGAGGCTCAAGAACTTGCCGGTAAATTCTTAGCTGATGATAAGTCGGTATTTGTTG 
TTTTTGCTGGCACACGAGATGCTAAAAAAGATGATAGTGAATCTGCCTTTGATGCTTTCCC 
AATTAAACTTAAAGATTTAAATAAATCTGAGATGGATACTTTCGGGAATGCGACACATTTG 
ATTATTAACAATAAGCAGATTCCACTTATTGCGGAAGCCACAAAAAGCTTTGCCGAGATGA 
AATTTGATGATTTGGTTACCCGTACTATTGATGGAAAAACGTATCGAGTTTCAGTCTGCTG 
TAATAATTTAGATTATGTCAAATTTGGGATTTATAGCGAGGGAAATAATAGTGATACTGCT 
CTCCAAGAATATTTAGTAGGAGAACGTACAGCTCTGGCAGATTTGCCAACAGGGACAGTAA 
AATATCGAGGTACTTGGGACGGGGTAATGTACAGTAAATCTGGCTCGGCAGGGGTTGAATC 
GCCAAGTAACAGCGAAAGTGGTACTCGTTCACTATTCGATGTAGATTTTGTCAATAAAAAA 
ATTAATGGCAAGCTGATTGCTAATGATGGTGTTGAAGAACGCCCAATGCTGACACTGGAAG 
GCAATCTGAAAGGGAATGGTTTTGGAGGCACAGCCAAAACGGGCAATTCTGGTTTTAATCT 
TGATCCCAAAAGTACGAATGGTGGCACGGTAGGGCATATAAATACTCAATTTGAAGGGGGC 
TTTTATGGCCCTAAGGCGACGGAATTAGGTGGTATTGTACAAAATACAGAAACGGATAAAG 
ATAGAGTCAGTATTACATTCGGCGGAAAACGTCAAATAGAAAAATAA 
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FIGURE 24 
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FIGURE 27 
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FIGURE 29A 
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